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Abstract. 

In spite of the recent quick growth of the Evolutionary Multi-objective 
Optimization (EMO) research field, there has been few trials to adapt 
the general variation operators to the particular context of the quest for 
the Pareto-optimal set. The only exceptions are some mating restrictions 
that take in account the distance between the potential mates - but con- 
tradictory conclusions have been reported. This paper introduces a par- 
ticular mating restriction for Evolutionary Multi-objective Algorithms, 
based on the Pareto dominance relation: the partner of a non-dominated 
individual will be preferably chosen among the individuals of the pop- 
ulation that it dominates. Coupled with the BLX crossover operator, 
two different ways of generating offspring are proposed. This recombina- 
tion scheme is validated within the well-known NSGA-II framework on 
three bi-objective benchmark problems and one real-world bi-objective 
constrained optimization problem. An acceleration of the progress of the 
population toward the Pareto set is observed on all problems. 



Introduction 

The idea of restricted mating is not new in Evolutionary Computation: Gold- 
berg already suggested to forbid, or at least restrict, the crossover between 
too different individuals (i.e. that are too far apart for some distance on the 
genotypic space) - which makes sense for single-objective problems as soon as 
the population has started to accumulate on different fitness peaks, as recom- 
bining individuals from different peaks would often lead to lethal individuals. 
This idea has been transposed in the framework of Evolutionary Multi-objective 
Algorithms (EMAs) by Hajela and Lin [S], and by Fonseca and Fleming j^j. Nev- 
ertheless, Zitzler and Thiele ^7] did not observe any improvement when mating 
similar individuals. On the other hand, Horn et al. [5] present an argument 
supporting mating of dissimilar individuals: in the multi-objective framework, 
because the population diversity is enforced, the information provided by very 
different solutions can be combined in such way that a new type of (good) com- 
promises can hopefully be obtained. Nevertheless, Schaffer reported the absence 



of the improvement when mating dissimilar individuals. To sum up, no clear con- 
clusion can be drawn from existing experiments on the usefulness of restricted 
mating based on the (dis) similarity between mates. 

On a real-world design problem, using a very specific representation, Wild- 
man et Parks have investigated different pairing strategies based on maxi- 
mizing or minimizing different similarity measures. In particular, the similarity 
in the sense of the dominance rank has been considered, and enforcing the mating 
of the individuals from the elite archive with the individuals from the population, 
in an archive-based EMA, has been observed to be beneficial. 

However, in all studies enumerated above, the efficiency of the proposed 
mating restrictions has been measured by the quality of the final non-dominated 
solutions, without addressing the issue of computational time. In this paper, 
we propose a restricted mating strategy whose main effect is to accelerate the 
progress of the population of an EMA toward the Pareto set. The main idea is 
fairly simple, and consists in using the Pareto dominance relation when choosing 
a mate for the best (non-dominated) individuals. Note that a more detailed 
presentation (in French) can be found in |1U| . 

The paper is organized as follows. Next section briefly introduces evolu- 
tionary multi-objective optimization, and describes in more detail the NSGA- 
II algorithm, one of the best performing EMA to-date, that will be used in 
all experiments. Two slightly different implementations of the dominance-based 
crossover operator are then proposed in Section [21 based on BLX-a crossover, 
used throughout this study. Section |31 presents some experimental results wit- 
nessing the acceleration of the progress toward the Pareto set when using the 
proposed mating restrictions. Finally, Section 01 gives some guidelines for a more 
rigorous and complete validation of the proposed strategy, as well as for its 
possible refinements. 

1 Evolutionary Multi-objective Optimization 

Multi-objective optimization aims at simultaneously optimizing several contra- 
dictory objectives. For such kind of problems, there does not exist a single opti- 
mal solution, and compromises have to be made. 

An element of the search space x is said to Pareto- dominate another element 
y if x is not worse than y with respect to all objectives, and is strictly better 
than y with respect to at least one objective. The set of all elements of the 
search space that are not Pareto-dominated by any other element is called the 
Pareto set of the multi-objective problem at hand: it represents the best possible 
compromises with respect to the contradictory objectives. 

Solving a multi-objective problem amounts to choose one solution among 
those non-dominated solutions, and some decision arguments have to be given. 
Unlike classical optimization methods, that generally find one of the Pareto 
optimal solutions by making the initial optimization problem single-objective, 
EMAs are to-date the only algorithms that directly search for the whole Pareto 



set, allowing decision makers to choose one of the Pareto solutions with more 
complete information. 

1.1 Pareto-based Evolutionary Algorithms 

In order to find a good approximation of the Pareto set (a uniform and well 
spread sampling of the non-dominated solutions, close to the actual Pareto set 
of the problem at hand), EMAs have to enforce some progress toward the Pareto 
set while, at the same time, preserving diversity between the non-dominated 
solutions. 

Numerous evolutionary methods have been designed in the past years for 
the particular task of searching for the Pareto set (the interested reader will 
find a good summary in [2]). The best performing among them (NSGA-II [3], 
SPEA2 |16|. PES A £Q) are directly based on the Pareto dominance relation, 
that actually ensures progressing toward the non-dominated set. 

Among the diversity preserving techniques, some were transposed to EMAs 
from single-objective EAs (such as sharing, for instance), while others, like the 
crowding distance described in next subsection, are specific to the multi-objective 
framework. 

Another recognized important feature of EMAs is elitism [T^j, directly related 
to the notion of the Pareto dominance in EMAs: the non-dominated individuals 
can be preserved either by maintaining an archive (SPEA2 and PESA) or by 
using a deterministic replacement procedure (NSGA-II). 

1.2 NSGA-II 

The NSGA-II algorithm has been proposed by Deb et al. in 2001 The progress 
toward the Pareto set is here favored by using a selection based on the Pareto 
ranking, that divides the population into a hierarchy of non-dominated subsets, 
as illustrated by figure Q[a) . All non-dominated individuals of the population 
are first labeled as being of rank 1; they are then temporarily removed from 
the population, and the process is repeated: the non-dominated individuals of 
the remainder of the population are given rank 2, and so on, until the whole 
population is ranked. 

NSGA-II diversity preserving technique is based on the crowding distance 
- one of the possible estimations of the density of the solutions belonging to 
the same non-dominated subset. The crowding distance of each individual i is 
computed as follows: the non-dominated subset to which the individual i belongs 
is ordered following each of the objectives; for each objective m, the distance 
^(m) _ j m (i _|_ 2.) — f m (i — 1) between the surrounding neighbors of individual i 
according to objective m is computed (Fig.^b)); the sum over all objectives of 
these distances is the crowding distance of individual i. 

The following comparison operator >- is then used during the Darwinian 
stages (selection and replacement) of NSGA-II: 
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Fig. 1. NSGA-II comparison criteria 



x >- y iff rank(ir) < rank(y) 

or rank(x) = rank(y) 

and crowding_dist(x) > crowding_dist(y) 
NSGA-II selection is based on tournament: it chooses an individual for re- 
production by uniformly drawing T individuals (generally, T = 2) from the 
population and returning the best of them with respect to the comparison op- 
erator >-. NSGA-II replacement is deterministic: it consists in merging parents 
and offspring together, and choosing the N best individuals in that global popu- 
lation, again using comparison operator >-. The algorithm NSGA-II is said to be 
elitist because the best (less crowded non-dominated) individuals are preserved 
from one generation to another. 



2 Dominance-based crossover 



The basic mechanism of the proposed mating restriction consists in allowing the 
mating of each of the best individuals only with an individual it dominates (if 
any), where best individuals means non-dominated individuals when applying 
NSGA-II, or members of the archive when using SPEA2 ou PESA. 

The rationale behind using the dominance relation to form the couples for 
the crossover is the following. If x dominates y, then x is better than y for all 
objectives. Hence, the direction y — > x is likely to improve all criteria simul- 
taneously. Furthermore, a natural continuation of the same idea is to bias the 
distribution of the offspring toward the dominant parent, as better individuals 
are more likely to be found close to it. However, it is clear that success of this 
idea depends on the behavior of the objective functions in the region of the 
decision space where the mated individuals sit. 

The resulting crossover, called Dominance-Based Crossover (DBX) will pro- 
ceed as follows: a first mate is chosen using the usual selection procedure of the 
EMA at hand (e.g. tournament based on the >- operator for NSGA-II). If the 
chosen individual is non-dominated and dominates some other individuals in the 
population, its mate is chosen among those. Otherwise, the mate is chosen us- 



ing the usual selection procedure. In any case, crossover then proceeds with the 
chosen operator. 

In this study, the well-known BLX-a crossover (0 < a < 1), proposed by 
Eshelman and Schaffer \5\ for real representations, has been used. Formally, 
given two parents (:Ci)ie[i,n] anc ^ (2/i)ie[i,n]) this operator produces an offspring 
by a random linear recombination of the parents as follows: 

((«»)»G[l,n]) (j/i)i£[l,n]) > (<f>i x i + i 1 ~ ^Vi) ie[l,n] ' C 1 ) 

where <pi = U[a,l + a]. In our particular case, given a non-dominated indi- 
vidual (a;t)te[x,nl from the NSGA-II population, two possible strategies will be 
considered to generate the offspring: 

1. Symmetric DBX: The mate (j/i)ie[i,n] is chosen from the list of the in- 
dividuals dominated by (^i)ie[i,nl) if anv 7 by tournament otherwise, and 
tf>i = U[—0.5, 1.5] in Equation y}. 

2. Biased DBX: Similarly, the mate (j/i)ie[i,nl is chosen from the list of the 
individuals dominated by (£i)ie[i, n ], if any, but now 4>i — E/[0.5, 1.5] in Equa- 
tion (I), i.e. the offspring will be closer to the first parent (^i)ie[i, n ]. 

3 Experimental results 

3.1 Experimental conditions 

This section presents some experimental results, on three standard benchmark 
problems |15| and on an industrial problem |12| . All experiments are run with 
population size 100 and tournament size 2. The two DBX crossovers are com- 
pared to the standard BLX-0.5, the crossover rate is set to 0.9. The uniform 
mutation operator is applied with rate 0.05. The algorithms run for at most 150 
(resp. 250) generations for ZDT-benchmarks (resp. the industrial problem). 

3.2 Bi-objective ZDT benchmarks 

For each of ZDT1-ZDT3 test problems 31 NSGAII runs have been performed 
for biased DBX, symmetric DBX and standard BLX-0.5 operators starting with 
the same 31 initial populations. The non-dominated individuals over all 31 runs 
have been calculated at each 10th generation approximately until the moment 
when the whole population is non-dominated, that means that DBX crossover 
is not applied any longer. These snapshots corresponding to biased DBX and 
standard BLX-0.5 are shown in the figures Eland Hfor ZDT1, ZDT2 and 
ZDT3 respectively. On all three test problems, a small but steady acceleration 
of the progress toward the Pareto front is observed when using DBX crossover. 
On those problems, very similar results have been obtained with the symmetric 
and biased DBX operators (the snapshots corresponding to the symmetric DBX 
are not shown for the space reasons). 
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Fig. 2. ZDT1: the black bullets (for the biased DBX) and the gray crosses (for the 
standard BLX-0.5) represent the non- dominated individuals over 31 runs at generations 
10, 20, 30 and 40. 




Fig. 3. ZDT2: the black bullets (for the biased DBX) and the gray crosses (for the 
standard BLX-0.5) represent the non-dominated individuals over 31 runs at generations 
20, 30, 40 and 50. 
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Fig. 4. ZDT3: the black bullets (for the biased DBX) and the gray crosses (for the 
standard BLX-0.5) represent the non- dominated individuals over 31 runs at generations 
10, 20, 30 and 40. 



3.3 Constrained bi-objective optimization 

This industrial problem consists in optimizing the structural geometry, described 
by 13 continuous parameters, of the front crash member of the car in order to 
minimize its mass while maximizing the internal energy absorbed during a crash 
(two competitive objectives) under 8 constraints arising from the acoustic and 
static mechanical domains |12| . 

The constraints have been handled using the so-called infeasibility objective 
approach the aggregated sum of the scaled constraint violations (that can 
be viewed as a measure of distance separating each individual from the feasible 
region) was considered as an additional optimization criterion - the infeasibility 
objective. NSGA-II was hence applied to a three-objective optimization problem, 
with the difference that the infeasibility objective had a higher priority than both 
others in the >- operator. In the other words, every individual located closer to 
the feasible region is considered better than any individual further away from 
it, regardless the corresponding values of the other objectives, the mass and the 
absorbed internal energy. 

DBX operators can be employed in this context exactly like for unconstrained 
problems, using the modified >- comparison. On every run, the restricted mating 
allowed faster approach of the feasible region, as well as faster feasibility of the 
whole population, as illustrated by Figure [5}a. Moreover, some differences with 
the results on the test problems ZDT have been steadily observed. First, biased 
DBX was more efficient than symmetric DBX. Then, and more importantly, 




Fig. 5. Effect of the dominance-based mating restriction in the presence of constraints: 
biased DBX (rings) VS standard BLX-0.5 (crosses) 



the use of mating restriction not only accelerated the search, it also provided 
solutions of better quality at the extremities of the Pareto front, as it can be 
seen on figure [SJ-b. 

4 Discussion and future work 

For all four problems considered in this study, the DBX operators (that only 
allow the mating of dominant individuals with individuals they dominate) have 
been shown to accelerate the progress of the populations toward the Pareto set. 
Moreover, for the optimization of the car front crash member, it also allowed 
finding solutions of better quality at the extremities of the Pareto set that could 
not be reached when using the usual recombination strategy. 

The observed acceleration is relatively small, but is systematic. When ob- 
serving NSGA-II dynamics (the typical NSGA-II evolution process is illustrated 
by Figure EJ), we realize that in fact only a small number of DBX crossovers are 
actually applied. First, note that restricted mating is applied at most as many 
times as there are non-dominated individuals in the population. There are few 
non-dominated individuals at the beginning of evolution (Fig. H3-a,b), but each 
of them dominates a lot of other individuals (Fig. EJb). As the population gets 
closer to the Pareto set, the number of non-dominated individuals rapidly in- 
creases (Fig. EJa,c), but each of them dominates only a few individuals if any 
(Fig.HJc). Finally, when the whole population is non-dominated (Fig.EJd), DBX 
crossover cannot be applied because no individual actually dominates anyone in 
the population! The actual rate of application of the DBX operator is maximal 
about the moment when the half of the population gets the rank 1 (Fig.EJc). 

One possible improvement would be to increase the rate of application of 
DBX operators at the beginning of evolution by mating each non-dominated 
individual with not only one but several individuals it dominates. 

Note that dynamic behavior of the populations described here above and, 
in particular, the disappearance of the dominated individuals is due to the the 
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Fig. 6. Evolution of the number of non-dominated individuals during a run of NSGA-II 

replacement procedure in NSGA-II, and might not be so prominent with other 
EMAs. It is hence worth investigating the use of DBX restricted mating with 
other EMAs paradigms, such as SPEA2 or PESA, for instance. 

Furthermore, there exist other crossover operators used in EAs (such as 
SBX 2 , for example) that could be applied together with dominance based 
restricted mating, instead of BLX-a. 

One more issue, that needs to be thoroughly investigated, is the efficiency of 
the DBX strategy when solving problems with more than two objectives. Indeed, 
in such situations, the "visual" analysis of the populations dynamics performed 
in the present work will not be possible anymore, and our plans include the use 
of so-called running metrics 0]. 

Indeed, the present study could use some performance metrics as well, instead 
of the graphical plots of the population dynamics in the objective space. However, 
in our opinion, such presentation is much clearer and makes things more explicit, 
hence it is more appropriate to the introduction of the new operator. 

But probably the most important issue is to find a way to actually benefit 
from the reported acceleration in practice. Whereas its usefulness is obvious 
for costly applications, where the number of evaluations has to be limited, the 
question remains of how to detect that the algorithm is getting stuck, thus 
saving the computational time that restricted mating allows us to spare. An 
efficient stopping criterion is needed before any actual benefit can be gained 
from the observed acceleration. Such criterion has been proposed by the first 
author |l()lllj . but it was beyond the scope of this paper. Nevertheless, together 



with the DBX operators described in this paper, it allowed an actual saving of 
about 8% of the computation time. However, further evaluation and refinement 
of that stopping criterion are still needed before definite strong conclusions can 
be drawn. 
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